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<210>INFORMATION FOR SEQ ID NO:l 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1610 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 

(C) ISOLATE: DNA coding sequence for HIV capsid protein Pr55 

<400> SEQUENCE DESCRIPTION: SEQ ID NO:l 



ATGGGTGCGA 


GAGCGTCGGT 


ATTAAGCGGG 


G GAGAAT TAG 


ATAAATGGGA 


AAAAATTCGG 


60 


TTAAGGCCAG 


GGGGAAAGAA 


AAAATATAAG 


TTAAAACATA 


TAGTATGGGC 


AAGCAGGGAG 


120 


CTAGAACGAT 


TCGCAGTCAA 


TCCTGGCCTG 


TTAGAAACAT 


CAGAAGGCTG 


CAGACAAATA 


180 


TTGGGACAGC 


TACAGCCATC 


CCTTCAGACA 


GGATCAGAAG 


AACTTAGATC 


ATTATATAAT 


240 


ACAGTAGCAA 


CCCTCTATTG 


TGTACATCAA AGGATAGATG 


TAAAAGACAC 


CAAGGAAGCT 


300 


TTAGAGAAGA 


TAGAGGAAGA 


GCAAAACAAA AGTAAGAAAA 


AG G CACAGC A 


AGCAGCAGCT 


360 


GCAGCTGGCA 


CAGGAAACAG 


CAGCCAGGTC 


AGCCAAAATT 


ACCCTATAGT 


GCAGAACCTA 


420 


CAGGGGCAAA 


TGGTACATCA 


GGCCATATCA 


CCTAGAACTT 


TAAATGCATG 


GGTAAAAGTA 


480 


GTAGAAGAAA 


AGGCTTTCAG 


CCCAGAAGTA 


AT AC C CAT GT 


TTTCAGCATT 


ATCAGAAGGA 


540 


GCCACCCCAC 


AAGATT T AAA 


CACCATGCTA 


AACACAGTGG 


GGGGACATCA 


AGCAGCCATG 


600 


CAAATGTTAA 


AAGAGACTAT 


CAATGAGGAA 


GCTGCAGAAT 


GGGATAGAGT 


GCATCCAGTG 


660 


CATGCAGGGC 


CTATTGCACC 


AGGCCAAATG 


AGAGAAC CAA 


GGGGAAGTGA 


CATAGCAGGA 


720 


ACTACTAGTA 


CCCTTCAGGA 


ACAAATAGGA 


TGGATGACAA 


ATAATCCACC 


TAT CCCAGTA 


730 


GGAGAAATCT 


AT AAAAGAT G 


GATAATCCTG 


GGATTAAATA 


AAATAGTAAG 


AATGTATAGC 


840 


CCTACCAGCA 


TTCTGGACAT 


AAGACAAGGA 


CCAAAGGAAC 


CCTTTAGAGA 


TTAT GTAGAC 


900 


C GGTT C TATA 


AAACTCTAAG 


AGCCGAACAA 


GCTTCACAGG 


ATGTAAAAAA 


TTGGATGACA 


960 


GAAACCTTGT 


TGGTCCAAAA 


TGCAAACCCA 


GATTGTAAGA 


CTATTTTAAA 


AGCATTGGGA 


1020 


CCAGCAGCTA 


CACTAGAAGA 


AATGATGACA 


GCATGTCAGG 


GAGT GGGGGG 


ACCCGGCCAT 


1080 


AAAGCAAGAG 


TTTTGGCTGA 


AGCCATGAGC 


CAAGTAACAA 


AT CCAGCTAA 


CAT AAT GAT G 


1140 


CAGAGAGGCA 


ATTTTAGGAA 


CCAAAGAAAG 


ACTGTTAAGT 


GTTTCAATTG 


TGGCAAAGAA 


1200 


GGGCACATAG 


CCAAAAATTG 


CAGGGCCCCT 


AGGAAAAAGG 


GCTGTTGGAG 


ATGTGGAAGG 


1260 


GAAGGACACC 


AAAT GAAAG A 


TTGCACTGAG 


AGACAGGCTA ATTTTTTAGG 


GAAGATCTGG 


1320 


CCTTCCTACA 


AGGGAAGGCC 


AGGGAATTTT 


CTTCAGAGCA 


GACCAGAGCC 


AACAGCCCCA 


1380 


C CAGAAGAGA 


GCTTCAGGTT 


TGGGGAGGAG 


AAAACAACTC 


CCTCTCAGAA 


GCAGGAGCCG 


1440 
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ATAGACAAGG AACTGTATCC TTTAACTTCC CTCAGATCAC TCTTTGGCAA CGACCCCTCG 150 0 
TCACAATAAG GATAGGGGGG CAACTAAAGG AAGCTCTATT AGATACAGGA GCAGAT GATA 15 60 
CAGTATTAGA AGAAAT GAAT TTGCCAGGAA AATGGAAACC AAAAAT GATA 1610 

<210> INFORMATION FOR SEQ ID NO : 2 

SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: PRT 
(vi) ORIGINAL SOURCE: 

(C) ISOLATE: peptide fragment of host cell (wheat germ) protein 

HP68 

<400> SEQUENCE DESCRIPTION: SEQ ID NO : 2 

Pro Arg Pro Tyr Leu Asp Val Lys Gin Arg Leu Lys Ala Ala Arg Val 

15 10 15 

lie Arg Ser Leu Leu Arg Ser Asn 
20 

<210> INFORMATION FOR SEQ ID NO: 3 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 

(C) ISOLATE: Degenerate oligonucleotide C-terminal peptide 
sequence of WGHP68 

<400> SEQUENCE DESCRIPTION: SEQ ID NO : 3 

AT GAAT T C AC TGGGACTGCG GATAGATTAC TGGTACTGGG GATC 44 

<210> INFORMATION FOR SEQ ID NO: 4 
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SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 

(C) ISOLATE: Degenerate oligonucleotide C-terminal peptide 
sequence of WGHP68 

<400> SEQUENCE DESCRIPTION: SEQ ID NO : 4 
ATGAATTCAC TGGGCTCTGA TAGATTACTG GTACTGGGGA TC 42 



<210> SEQ ID NO: 5 
<211> Length: 604 
<212> Type: PRT 

<213> Organism: Triricum aestivum 
<400> Sequence 5 

Met Ala Asp Arg Leu Thr Arg He Ala He Val Ser Glu Asp Lys Cys 

15 10 15 

Lys Pro Lys Lys Cys Arg Gin Glu Cys Lys Lys Ser Cys Pro Val Val 

20 25 30 

Lys Thr Gly Lys Leu Cys He Glu Val Ser Pro Val Ala Lys Leu Ala 

35 40 45 

Phe He Ser Glu Glu Leu Cys He Gly Cys Gly He Cys Val Lys Lys 

50 55 60 

Cys Pro Phe Asp Ala He Glu He He Asn Leu Pro Lys Asp Leu Glu 
65 70 75 80 

Lys Asp Thr Thr His Arg Tyr Gly Pro Asn Thr Phe Lys Leu His Arg 

85 90 95 

Leu Pro Val Pro Arg Pro Gly Gin Val Leu Gly Leu Val Gly Thr Asn 

100 105 HO 

Gly He Gly Lys Ser Thr Ala Leu Lys Val Leu Ala Gly Lys Leu Lys 

115 120 125 

Pro Asn Leu Gly Arg Phe Lys Asn Pro Pro Asp Trp Gin Glu He Leu 
130 135 140 
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Thr Tyr Phe Arg Gly Ser Glu Leu Gin Asn Tyr Phe Thr Arg lie Leu 
145 150 155 160 

Glu Asp Asn Leu Lys Ala lie lie Lys Pro Gin Tyr Val Asp His lie 
165 170 175 

5 Pro Lys Ala Val Gin Gly Asn Val Gly Gin Val Leu Glu Gin Lys Asp 
180 185 190 

Glu Arg Asp Met Lys Asn Glu Leu Cys Val Asp Leu Glu Leu Asn Gin 

195 200 205 

Val He Asp Arg Asn Val Gly Asp Leu Ser Gly Gly Glu Leu Gin Arg 
10 210 215 220 

Phe Ala He Ala Val Val Ala Val Gin Ser Ala Glu He Tyr Met Phe 
225 230 235 240 

Asp Glu Pro Ser Ser Tyr Leu Asp Val Lys Gin Arg Leu Lys Ala Ala 
245 250 255 

15 Arg Val He Arg Ser Leu Leu Arg Ser Asn Ser Tyr Val lie Val Val 
;* 260 265 270 

if~| Glu His Asp Leu Ser Val Leu Asp Tyr Leu Ser Asp Phe He Cys Cys 

«S 275 280 285 

If Leu Tyr Gly Lys Pro Gly Ala Tyr Gly Val Val Thr Leu Pro Phe Ser 

!! -Li 

fJO 290 295 300 

IH Val Arg Glu Gly lie Asn lie Phe Leu Ala Gly Phe Val Pro Thr Glu 
.L 305 310 315 320 

Asn Leu Arg Phe Arg Asp Glu Ser Leu Thr Phe Lys lie Ala Glu Thr 
S3 325 330 335 

l%5 Gin Glu Ser Ala Glu Glu Val Ala Thr Tyr Gin Arg Tyr Lys Tyr Pro 
jjy 340 345 350 

Thr Met Ser Lys Thr Gin Gly Asn Phe Lys Leu Ser Val Val Glu Gly 

355 360 365 

Glu Phe Thr Asp Ser Gin He Val Val Met Leu Gly Glu Asn Gly Thr 
30 370 375 380 

Gly Lys Thr Thr Phe He Arg Met Leu Ala Gly Leu Leu Lys Pro Asp 
385 390 395 400 

Thr Met Glu Gly Thr Glu Val Glu He Pro Glu Phe Asn Val Ser Tyr 
405 410 415 

35 Lys Pro Gin Lys lie Ser Pro Lys Phe Gin His Pro Val Arg His Leu 
420 425 430 

Leu His Ser Lys lie Arg Asp Ser Tyr Thr His Pro Gin Phe Val Ser 

435 440 445 

Asp Val Met Lys Pro Leu Gin lie Glu Gin Leu Met Asp Gin Glu Val 
40 450 455 460 

He Asn Leu Ser Gly Gly Glu Leu Gin Arg Val Ala Leu Cys Leu Cys 
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465 470 475 480 

Leu Gly Lys Pro Ala Asp lie Tyr Leu lie Asp Glu Pro Ser Ala Tyr 

485 490 495 

Leu Asp Ser Glu Gin Arg lie Val Ala Ser Lys Val lie Lys Arg Phe 
5 500 505 510 

lie Leu His Ala Lys Lys Thr Ala Phe lie Val Glu His Asp Phe lie 

515 520 525 

Met Ala Thr Tyr Leu Ala Asp Lys Val He Val Tyr Glu Gly Leu Ala 
530 535 540 

10 Ser He Asp Cys Thr Ala Asn Ala Pro Gin Ser Leu Val Ser Gly Met 
545 550 555 560 

Asn Lys Phe Leu Ser His Leu Asp He Thr Phe Arg Arg Asp Pro Thr 

565 570 575 

Asn Tyr Arg Pro Arg He Asn Lys Leu Glu Ser Thr Lys Asp Arg Glu 
15 580 585 590 

3 Gin Lys Asn Ala Gly Ser Tyr Tyr Tyr Leu Asp Asp 
t 595 600 

U <210> SEQ ID NO: 6 
^0 <211> LENGTH: 599 

<212> TYPE: PRT 
3 <213> ORGANISM: Homo sapians 

<4 00> SEQUENCE 6 

25 Met Ala Asp Lys Leu Thr Arg He Ala He Val Asn His Asp Lys Cys 
15 10 15 

Lys Pro Lys Lys Cys Arg Gin Glu Cys Lys Lys Ser Cys Pro Val Val 

20 25 30 

Arg Met Gly Lys Leu Cys He Glu Val Thr Pro Gin Ser Lys He Ala 
30 35 40 45 

Trp He Ser Glu Thr Leu Cys He Gly Cys Gly He Cys He Lys Lys 

50 55 60 

Cys Pro Phe Gly Ala Leu Ser He Val Asn Leu Pro Ser Asn Leu Glu 
65 70 75 80 

35 Lys Glu Thr Thr His Arg Tyr Cys Ala Asn Ala Phe Lys Leu His Arg 

85 90 95 

Leu Pro He Pro Arg Pro Gly Glu Val Leu Gly Leu Val Gly Thr Asn 

100 105 110 

Gly He Gly Lys Ser Ala Ala Leu Lys He Leu Ala Gly Lys Gin Lys 
40 115 120 125 

Pro Asn Leu Gly Lys Tyr Asp Asp Pro Pro Asp Trp Gin Glu He Leu 
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130 135 140 





Thr 


Tyr 


Phe 


Arg 


Gly 


Ser 


Glu 


Leu 


Gin 


Asn 


Tyr 


Phe 


Thr 


Lys 


He 


Leu 




145 










150 










155 










160 




Glu 


Asp 


Asp 


Leu 


Lys 


Ala 


He 


He 


Lys 


Pro 


Gin 


Tyr 


Val 


Ala 


Arg 


Phe 


5 










165 










170 










175 






Leu 


Arg 


Leu 


Ala 
180 


Lys 


Gly 


Thr 


Val 


Gly 
185 


Ser 


He 


Leu 


Asp 


Arg 
190 


Lys 


Asp 




Glu 


Thr 


Lys 
195 


Thr 


Gin 


Ala 


He 


Val 

200 


Cys 


Gin 


Gin 


Leu 


Asp 

205 


Leu 


Thr 


His 


10 


Leu 


Lys 
210 


Glu 


Arg 


Asn 


Val 


Glu 
215 


Asp 


Leu 


Ser 


Gly 


Gly 
220 


Glu 


Leu 


Gin 


Arg 




Phe 


Ala 


Cys 


Ala 


Val 


Val 


Cys 


He 


Gin 


Lys 


Ala 


Asp 


He 


Phe 


Met 


Phe 




225 










230 










235 










240 




Asp 


Glu 


Pro 


Ser 


Ser 


Tyr 


Leu 


Asp 


Val 


Lys 


Gin 


Arg 


Leu 


Lys 


Ala 


Ala 


15 










245 










250 










255 




ST"-* 


He 


Thr 


He 


Arg 


Ser 


Leu 


He 


Asn 


Pro 


Asp 


Arg 


Tyr 


He 


He 


Val 


Val 


■■• S3? 
':? 








260 










265 










270 








Glu 


His 


Asp 


Leu 


Ser 


Val 


Leu 


Asp 


Tyr 


Leu 


Ser 


Asp 


Phe 


He 


Cys 


Cys 








275 










280 










285 








mo 


Leu 


Tyr 


Gly 


Val 


Pro 


Ser 


Ala 


Tyr 


Gly 


Val 


Val 


Thr 


Met 


Pro 


Phe 


Ser 


a,? ; 




290 










295 










300 










if 


Val 


Arg 


Glu 


Gly 


He 


Asn 


He 


Phe 


Leu 


Asp 


Gly 


Tyr 


Val 


Pro 


Thr 


Glu 




305 










310 










315 










320 


* 


Asn 


Leu 


Arg 


Phe 


Arg 


Asp 


Ala 


Ser 


Leu 


Val 


Phe 


Lys 


Val 


Ala 


Glu 


Thr 


I 5 










32 5 










330 










335 






Ala 


Asn 


Glu 


Glu 
340 


Glu 


Val 


Lys 


Lys 


Met 
345 


Cys 


Met 


Tyr 


Lys 


Tyr 
350 


Pro 


Gly 




Met 


Lys 


Lys 
355 


Lys 


Met 


Gly 


Glu 


Phe 
360 


Glu 


Leu 


Ala 


He 


Val 
365 


Ala 


Gly 


Glu 


30 


Phe 


Thr 
370 


Asp 


Ser 


Glu 


He 


Met 
375 


Val 


Met 


Leu 


Gly 


Glu 
380 


Asn 


Gly 


Thr 


Gly 




Lys 


Thr 


Thr 


Phe 


He 


Arg 


Met 


Leu 


Ala 


Gly 


Arg 


Leu 


Lys 


Pro 


Asp 


Glu 




385 










390 










395 










400 




Gly 


Gly 


Glu 


Val 


Pro 


Val 


Leu 


Asn 


Val 


Ser 


Tyr 


Lys 


Pro 


Gin 


Lys 


He 


35 










405 










410 










415 






Ser 


Pro 


Lys 


Ser 
420 


Thr 


Gly 


Ser 


Val 


Arg 
425 


Gin 


Leu 


Leu 


His 


Glu 
430 


Lys 


He 




Arg 


Asp 


Ala 
435 


Tyr 


Thr 


His 


Pro 


Gin 
440 


Phe 


Val 


Thr 


Asp 


Val 
445 


Met 


Lys 


Pro 


40 


Leu 


Gin 


He 


Glu 


Asn 


He 


He 


Asp 


Gin 


Glu 


Val 


Gin 


Thr 


Leu 


Ser 


Gly 



450 455 460 
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Gly Glu Leu Gin Arg Val Arg Leu Arg Leu Cys Leu Gly Lys Pro Ala 
465 470 475 480 

Asp Val Tyr Leu lie Asp Glu Pro Ser Ala Tyr Leu Asp Ser Glu Gin 
485 490 495 

5 Arg Leu Met Ala Ala Arg Val Val Lys Arg Phe lie Leu His Ala Lys 
500 505 510 

Lys Thr Ala Phe Val Val Glu His Asp Phe lie Met Ala Thr Tyr Leu 

515 520 525 

Ala Asp Arg Val lie Val Phe Asp Gly Val Pro Ser Lys Asn Thr Val 
10 530 535 540 

Ala Asn Ser Pro Gin Thr Leu Leu Ala Gly Met Asn Lys Phe Leu Ser 
545 550 555 560 

Gin Leu Glu lie Thr Phe Arg Arg Asp Pro Asn Asn Tyr Arg Pro Arg 
565 570 575 

15 He Asn Lys Leu Asn Ser He Lys Asp Val Glu Gin Lys Lys Ser Gly 

580 585 590 

[ Asn Tyr Phe Phe Leu Asp Asp 

! 595 



1220 



fU 

is 



30 



35 
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